Top-Down Hierarchical Ensembles of Classifiers for Predicting G-Protein-Coupled-Receptor Functions
نویسندگان
چکیده
Despite the recent advances in Molecular Biology, the function of a large amount of proteins is still unknown. An approach that can be used in the prediction of a protein function consists of searching against secondary databases, also known as signature databases. Different strategies can be applied to use protein signatures in the prediction of function of proteins. A sophisticated approach consists of inducing a classification model for this prediction. This paper applies five hierarchical classification methods based on the standard Top-Down approach and one hierarchical classification method based on a new approach named Top-Down Ensembles based on the hierarchical combination of classifiers to three different protein functional classification datasets that employ protein signatures. The algorithm based on the Top-Down Ensembles approach presented slightly better results than the other algorithms, indicating that combinations of classifiers can improve the performance of hierarchical classification models.
منابع مشابه
Hierarchical classification of G-Protein-Coupled Receptors with data-driven selection of attributes and classifiers
We address the important bioinformatics problem of predicting protein function from a protein's primary sequence. We consider the functional classification of G-Protein-Coupled Receptors (GPCRs), whose functions are specified in a class hierarchy. We tackle this task using a novel top-down hierarchical classification system where, for each node in the class hierarchy, the predictor attributes t...
متن کاملOn the hierarchical classification of G protein-coupled receptors
MOTIVATION G protein-coupled receptors (GPCRs) play an important role in many physiological systems by transducing an extracellular signal into an intracellular response. Over 50% of all marketed drugs are targeted towards a GPCR. There is considerable interest in developing an algorithm that could effectively predict the function of a GPCR from its primary sequence. Such an algorithm is useful...
متن کاملHierarchical classification of protein function with ensembles of rules and particle swarm optimisation
This paper focuses on hierarchical classification problems where the classes to be predicted are organized in the form of a tree. The standard top-down divide and conquer approach for hierarchical classification consists of building a hierarchy of classifiers where a classifier is built for each internal (non-leaf) node in the class tree. Each classifier discriminates only between its child cla...
متن کاملIterative Construction of Hierarchical Classifiers for Phishing Website Detection
This article is devoted to a new iterative construction of hierarchical classifiers in SimpleCLI for the detection of phishing websites. Our new construction of hierarchical systems creates ensembles of ensembles in SimpleCLI by iteratively linking a top-level ensemble to another middle-level ensemble instead of a base classifier so that the top-level ensemble can generate a large multilevel sy...
متن کاملG-protein Coupled Receptor Dimerization
A growing body of evidence suggests that GPCRs exist and function as dimers or higher oligomers. The evidence for GPCR dimerization comes from biochemical, biophysical and functional studies. In addition, researchers have shown the occurrence of heterodimerization between different members of the GPCR family. Two receptors can interact with each other to make a dimer through their extracellular...
متن کامل